Amino acid substitution matrices.
نویسندگان
چکیده
The BLOSUM (BLOck SUbstitution Matrices) matrices were derived by Steven and Jorja Henikoff in 1992 1. They were based on a much larger data set than the PAM matrices, and used conserved local alignments or “blocks,” rather than global alignments of very closely related sequences. In order to account for different degrees of sequence divergence, the Henikoffs used clustering rather than an explicit evolutionary model. Clustering with different values of n, ranging from 45% to 90%, produces a parameterized set of matrices representing different degrees of sequence divergence. The clustering procedure also addressed the issue of sample bias.
منابع مشابه
Amino acid substitution matrices for protein conformation identification
Methods for alignment of protein sequences typically measure similarity by using substitution matrix with scores for all possible exchanges of one amino acid with another. Although widely used, the matrices derived from homologous sequence segments, such as Dayhoff’s PAM matrices and Henikoff’s BLOSUM matrices, are not specific for protein conformation identification. Using a different approach...
متن کاملPosition Dependent and Independent Evolutionary Models Based on Empirical Amino Acid Substitution Matrices
Evolutionary models measure the probability of amino acid substitutions occurring over different evolutionary distances. We examine various evolutionary models based on empirically derived amino acid substitution matrices. The models are constructed using the PAM and BLOSUM amino acid substitution matrices. We rescale these matrices by raising them to powers to model substitution patterns that ...
متن کاملAmino Acid Substitution Matrices Estimated by Maximum Likelihood
The present work describes protrates, a program that estimates amino acid substitution matrices and among-site substitution rates based on their likelihood for a given tree topology and a dataset of aligned proteins. The issue of producing maximum likelihood (ML) rate matrices over protein data have been adressed under the framework of general-purpose unbiased substitution matrices [1, 9], sinc...
متن کاملGenome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome
The genomic era has seen a remarkable increase in the number of genomes being sequenced and annotated. Nonetheless, annotation remains a serious challenge for compositionally biased genomes. For the preliminary annotation, popular nucleotide and protein comparison methods such as BLAST are widely employed. These methods make use of matrices to score alignments such as the amino acid substitutio...
متن کاملInconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues
The adequacy of substitution matrices to model evolutionary relationships between amino acid sequences can be numerically evaluated by checking the mathematical property of triangle inequality for all triplets of residues. By converting substitution scores into distances, one can verify that a direct path between two amino acids is shorter than a path passing through a third amino acid in the a...
متن کاملThe construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions
MOTIVATION Amino acid substitution matrices play a central role in protein alignment methods. Standard log-odds matrices, such as those of the PAM and BLOSUM series, are constructed from large sets of protein alignments having implicit background amino acid frequencies. However, these matrices frequently are used to compare proteins with markedly different amino acid compositions, such as trans...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Advances in protein chemistry
دوره 54 شماره
صفحات -
تاریخ انتشار 2000